CBT Campus' Online Skills Training Courses.

IT Skills

Enterprise Database Systems

Big Data

Apache Storm Introduction

df_apsi_a02_it_enus

df_apsi_a01_it_enus

Apache Storm Introduction - API and Topology

Course Number:
df_apsi_a02_it_enus

Expected Duration (hours)
3.1

Lesson Objectives

Apache Storm Introduction - API and Topology

start the course
use Trident for a simple topology
describe topology state management with Trident
describe the different types of Trident spouts available for implementing fault-tolerant Trident state management
describe the different Trident State APIs available for implementing fault-tolerant Trident state management
describe distributed RPC model and how it is used with Apache Storm
describe DRPC modes of operation and topology types
deploy a Trident topology to a Storm cluster
describe the Storm UI home page
launch a Storm topology to a local cluster and view cluster activity in the Storm UI
analyze a Storm topology using the Storm UI
describe the process of using the Nimbus Thrift client for obtaining Storm cluster metrics
set up a Maven project in Eclipse IDE that can be used to write Java client code for connecting to a Nimbus Thrift server
write Java client code that connects to a Nimbus Thrift server and retrieves Storm cluster statistics
describe the general architecture of Apache Kafka
describe Kafka components and data model
produce and consume a Kafka topic
consume Kafka messages in a Storm topology
describe some options for using Storm's Core APIs to implement micro-batching in a Storm Core topology
describe Apache Hadoop's use with Storm
download and install Apache Hadoop on a development machine
describe how Apache Storm applications can be run on Hadoop YARN clusters to leverage YARN resource management
describe the Puppet architecture and some key framework components
describe how JMX and Ganglia can be integrated and used to monitor Storm clusters
describe how HBase and Redis can be integrated and used as datastores with Apache Storm
integrate and use JMX in Storm to obtain Storm Nimbus and Supervisor metrics
demonstrate increased knowledge of configuring and installing Apache Storm

Overview/Description
Storm makes it easy to stream massive amounts of unbounded streams of data while providing a fault-tolerant system. This system accompanied with other integrations can take any business to the next level. In this course, you will be introduced to Trident and its relationship to Storm, along with its integration. Many other integrations will be explored including Hadoop, Kafka, JMX, Ganglia, and even automation using Puppet as well as monitoring and analytics tools. In this course, you will also learn how to deploy the Storm architecture.

Target Audience
Individuals interested in how to perform an installation and configure it, as well as design and create basic Storm topologies

Apache Storm Introduction – Architecture and Installation

Course Number:
df_apsi_a01_it_enus

Expected Duration (hours)
2.8

Lesson Objectives

Apache Storm Introduction – Architecture and Installation

start the course
describe in a higher scope, Apache Storm and its characteristics
describe why Apache Storm is used
describe the Apache Storm Architecture
identify a tuple and a bolt and their use in Storm
identify a spout and its use in Storm
identify streams and their use in Storm
describe the different operation modes of Storm
identify Storm components and their functionality in the source code for an example Storm application
describe the setup process for an Integrated Storm development environment
use Maven to compile and run a Storm application
describe the installation and setup process for ZooKeeper as a standalone server
install and set up ZooKeeper on a development machine
deploy a ZooKeeper server in standalone mode and test it with a ZooKeeper client connection
describe the process for setting up and deploying a ZooKeeper cluster
demonstrate the process of setting up a production Storm cluster
describe the process of configuring the parallelism of a topology
configure the parallelism of spout and bolt components in a Storm topology
describe briefly stream groupings and their types
use stream groupings in a Storm topology
describe the Guaranteed Messaging Process
describe the fault-tolerant characteristics of Storm
describe briefly what Trident is and how it's used
describe Trident's data model and its use
describe several operations of Trident
test your knowledge of Apache Storm and the components of the system

Overview/Description
Apache Storm is a fast and scalable open source distribution system that drives real-time computations. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun. This course will introduce Storm providing an in depth approach to the concepts and how Apache Storm ticks. Apache Storm can integrate with many other products and this course will introduce you to ZooKeeper along with the Storm Trident Architecture and how they are used with Storm model.

Target Audience
Individuals interested in learning the architecture of Apache Storm and how it can be used.